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Metalloporphyrins are ubiquitous in nature, particularly iron porphyrins (hemes) and magnesium 
dihydroporphyrins or chlorophylls. Oxovanadium (IV) complexes of alkyl porphyrins are widely 
distributed in petroleum, oil shales and maturing sedimentary bitumen. Here we identify new vanadium 
compounds in Venezuela Orinoco heavy crude oil detected by Fourier transform-ion cyclotron resonance 
mass spectrometry (FT-ICR MS). These compounds likely have the main structure of porphyrin, with the 
addition of more aromatic rings, thiophene and amino functional groups, corresponding to molecular series 
of C,H2n.4oN4ViOi (36^n^ 58),C,H2n.42N4ViOi (37 ^n^ 57),C,H2n.44N4ViOi (38 ^ n ^ 

59),CnH2n.46^4ViOi (43 ^ n ^ 54),CnH2n.48^4ViOi (45^11^ 55),CnH2n.38N4ViSiOi (36^11^ 

4aC^H2n.4oN4ViSiOi (35 ^n^ 5aC^H2n.42N4ViSiOi (36 ^n^ 54),C^H2n.44N4ViSiOi (41 ^ n ^ 

55),CnH2n.46N4ViSiOi (39^11^ 55),CnH2n.27N5ViOi (29^11^ 40),CnH2n.29^5ViOi (34^11^ 

42),CnH2n.33N5ViOi (31^11^ 38),CnH2n.35^5ViOi (32^11^ 41),CnH2n.27^5Vi02 (32^11^ 41) aud 
Cnli2n-29^5^i^2 (33 < 13 < 42). These findings are significant for the understanding of the existing form of 
vanadium species in nature, and are helpful for enhancing the amount of information on 
palaeoenvironments and improving the level of applied basic theory for the processing technologies of heavy 
oils. 



The reserves of heavy petroleum are 8.90 trillion barrels, much larger than the 1.64 trillion barrels of 
conventional crude oiP'^. Understanding of the molecular structures of heavy fossil feedstocks is valuable 
for their utilization, but characterization of these ultra-complex materials is very challenging. Vanadium 
compounds are present in heavy petroleum as porphyrins^. Petroporphyrins have been extensively studied since 
their discovery in crude oils and shales as "molecular fossils" by Treibs^ ^ They provide palaeoenvironmental 
information on the deposition environment^"^". A portion of the vanadium compounds give strong optical 
absorption in the Soret band at circa 400 nm, but the remainder do not, likely due to formation of complexes 
or due to chemical modification of the porphyrin ring^\ 

Six series of petroporphyrins have been identified in fossil fuels using mass spectrometry^^"^^; Qian et al.^^ 
successfully identified vanadyl porphyrins in unfractionated asphaltenes for the first time, and gave the primary 
evidence of cycloalkane-substituted and sulfur- containing vanadyl porphyrins. The more complex vanadyl 
compounds have not been identified. This is due to the low concentration of metalloporphyrins and the com- 
plexity of asphaltenes in heavy crude oil. Effective separation and ultra-high mass resolution are needed to resolve 
these vanadium compounds. 

The heavy crude oil used in our studies was obtained from the Orinoco Basin in Venezuela, which was of 
particular interest, including large accumulations of conventional and medium oil, while at the same time 
possessing an immense resource of both heavy oil and natural bitumen^ The sulfur and nitrogen elemental 
contents of the crude oil were 3.90 wt% and 0.74 wt%, respectively. The density was 1.03 g/cm^ at 20°C. 
Vanadium concentration was 513.31 wppm. The crude oil sample was subjected to solvent extractions and silica 
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Figure 1 | Expanded mass scale spectra of fractions from the positive-ion ESI FT-ICR MS analysis (a) For fraction M6 at m/z 602, the mass peak with a 
red star was one of the new vanadyl porphyrins with DBE=23, and the green stars show the derivative intermediate of chlorophyll or heme, which 
corresponding to [C33H34N4Vi04+H]^ and [C34H38N4Vi03 + H]^ described elsewhere^^; (b) For fraction M5 at m/z 660, the mass peak with a red star 
corresponds to [C39H36N4ViOiSi + H]^ with DBE=24; (c) For fraction M6 at m/z 587, the mass peak with a red star corresponds to 
[C34H4iN5ViOi+H]^ with DBE=17, the green stars show the isotopic mass peak of [C33H34N4Vi03 + H]^ and [C34H38N4V1O2+H] ^described 
elsewhere^^; (d) For fraction M6 at m/z 631, the mass peak with a red star corresponds to [C36H45N5V1O2+H] ^ with DBF = 17, the green star shows the 
isotopic mass peak of [C39H38N4ViOi + H]^ with DBE=23. 



gel chromatographic separations which have been described else- 
where^^. The vanadium-rich fractions, named M4, M5, and M6, 
respectively were investigated by positive-ion electrospray ionization 
(ESI) Fourier transform ion cyclotron resonance mass spectrometry 
(FT-ICR MS). All experiments were conducted on a Bruker apex- 
ultra FT-ICR MS equipped with a 9.4 T actively shielded supercon- 
ducting magnet. FT-ICR MS has the highest available broadband 
mass resolution, mass resolving power, and mass accuracy, which 
enables the assignment of a unique elemental composition to each 
peak in the mass spectrum^^'^°. 

In an earlier paper^^, in addition to the six known types of vanadyl 
porphyrins, etio porphyrins (ETIO), deoxophylloerythroetio 
porphyrins (DPEP), dicyclic-deoxophylloerythroetio porphyrins 
(Di-DPEP), Rhodo-etio porphyrins (Rhodo-ETIO), Rhodo-deoxo- 
phylloerythroetio porphyrins (Rhodo-DPEP), and Rhodo-dicyclic- 
deoxophylloerythroetio (Rhodo-Di-DPEP), three kinds of vanadyl 
porphyrins corresponding of molecular formula CnHniN4V02, 
CnHinN4V03, CnHinN4V04, respectively, were detected by pos- 
itive-ion ESI FT-ICR MS in the Venezuela Orinoco heavy crude oil 
for the first time. These formulae were consistent with intermediate 
derivatives of chlorophyll or heme, with functional groups of carbo- 
nyl and/or carboxyl at the periphery of porphyrin structures. This 



evidence for CnH^N4Vi02, CnH^N4Vi03, and CnH^N4Vi04 class 
species in crude oil adds further support to the hypothesis that pet- 
roporphyrins were derived from chlorophylls and hemes by indi- 
cating intermediate structures. 

In this study, we found that some vanadium compounds existed 
with the main structure of porphyrin, but combined with other func- 
tional groups containing oxygen atoms, sulfur atoms and nitrogen 
atoms. In addition, the evidences of ultra-high resolution mass spec- 
trometry and the possible structures were also provided. The vana- 
dium compounds with O, S and N should associate more strongly 
with the asphaltenes than the less polar components. Formation of 
complexes with other components would shift and attenuate the 
Soret absorption in UV-visible spectroscopy. 

Results 

Ultra-high resolution mass spectra for the new vanadium com- 
pounds. Figure 1 shows the close-up view of expanded mass scale 
spectra of fractions M5 and M6. Figure 1 (a) is for fraction M6 at m/z 
602, which was internally calibrated with [C43H7iNi+H]^. Mass peaks 
were assigned base on the high mass resolution and mass accuracy. 
The mass peak marked with a red star was one of the new vanadyl 
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Figure 2 | The comparison chart between the real and calculated mass spectra, (a) For fraction M6 at m/z 602, 603; (b) For fraction M5 at m/z 660, 661; 
(c) For fraction M6 at m/z 587, 588; (d) For fraction M6 at m/z 631, 632. 



porphyrins [C37H34N4ViOi + H]+ with DBE=23, and the green 
stars indicate the derivative intermediate of chlorophyll or heme, 
which correspond to [C33H34N4Vi04+H]^ and [C34H38N4 
Vi03+H]^ described elsewhere^^. Common vanadyl porphyrins 
contained six types of DBE (Double Bond Equivalence), from 17 
to 22 shown in Supplementary Figure 1. In this work, the 
CnHnaN4ViOi compounds with DBE=24, 25, 26, and 27 were also 
detected, which was shown in Supplementary Figure 2. 

Figure 1 (b) is for fraction M5 at m/z 660, which was internally 
calibrated with the known adjacent peak of [C48H69Ni+H]^. The 
mass peak with a red star was the [C39H36N4ViOiSi + H]^ with 
DBE =24, its mass error was only 0.07 mDa. These series of vana- 
dium compounds with sulfur atoms show continuous distribution. 
The CnHinN4ViOiSi series with DBE =22, 23, 25, and 26 were also 
detected and shown in Supplementary Figure 3 and 4. 

Figure 1 (c) and (d) is for M6 at m/z 587, 631, which were intern- 
ally calibrated with [C42H67Ni + H]+ and [C45H75Ni + H]+. The 
mass peak with a red star in Figure 1 (c) was the [C34H41N5 
ViOi + H]^ with DBE=17, which had a mass error of only 
0.01 mDa. These series of vanadium compounds with five nitrogen 
atoms show the same DBE values with ETIOs. The mass peak with a 
red star in Figure 1 (d) corresponds to [C36H45N5Vi02+H]^, which 
has a DBE value of 17. These series also show the same DBE with that 
of ETIOs. That means if these series of compounds contain the 
similar structure of vanadium porphyrin, the excess nitrogen atom 
and oxygen atom should locate on the side-chains. The N and O 



adducted species were extensive, the CnHmNsViOi compounds with 
DBE= 18, 20, and 21; CnHn,N5Vi02 compounds with DBE= 18 were 
detected and shown in Supplementary Figure 5 and 6. 

Comparison of the isotope ratio between the real and calculated 
mass spectra. Identification of these vanadium compounds were 
performed by assigning the spectrum peaks to accurate mass 
values and isotopic masses, and by observing their characteristic 
serial distribution at the large mass range. An exact mass match 
(within 0.5 mDa) is not sufficient to unambiguously identify the 
presence of vanadium compounds. The isotope ratio is critical to 
confirm the identification in addition to matching molecular 
mass^^. Figure 2 shows the comparison chart between the real and 
calculated mass spectra, which covers these new series of vanadium 
compounds and their substitution for one ^^C, the theoretical 
prediction of the isotope distribution generated by Bruker DA 
software. For example. Figure 2 (a) shows the isotope ratio of 
[C37H34N40iVi + H]^, the mass error is only 0.02 mDa, both of 
the real and calculated isotope rations for [^^CC36H34N40i 
Vi + H]^ are 0.40. Therefore, good agreements were not only 
found in the accuracy mass, but also in the isotope ratios. They 
were the powerful and significant evidences for these new 
vanadium compounds. 

Types and distributions of these new vanadium compounds. All of 

these new series vanadyl porphyrins show continuous distributions. 
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Figure 3 | Iso-abundant plots of double bond equivalents (DBE) as a function of carbon number for each type of vanadium compounds derived from 
positive-ion ESI FT-ICR MS. (a) For the vanadyl porphyrins in fraction M5, containing eleven types with DBFs from 17 to 27; (b) For the vanadium 
compounds with one sulfur atom in fraction M5, containing five types with DBFs from 22 to 26; (c) For the vanadium compounds with five 
nitrogen atoms and one oxygen atom in fraction M6, containing four types with DBFs, 17, 18, 20, and 21 respectively; (d) For the vanadium compounds 
with five nitrogen atoms and two oxygen atoms in fraction M6, containing two types with DBF of 17 and 18. 



Figure 3 shows the iso-abundant plots of DBE as a function of carbon 
number for each type of vanadium compounds derived from 
positive-ion ESI FT-ICR MS. The red line in Figure 3 (a) is the 
absolute upper limit of planar aromatic compounds as found in 
petroleum, as opposed to curved or fullerene structures which have 
not been detected^\ These new series of vanadium compounds 
detected in Venezuela heavy crude oil are divided into three 
classes. (1), with the basic porphyrin structure and high DBEs, 
corresponding to molecules CnH2n-4oN4ViOi (36 ^ n ^ 

58XCnH2n-42^4yiOi (37 ^ H ^ 57),Cr,H2n-44N4ViO i (38 ^ H ^ 

59),CnH2n-46N4ViOi (43 ^ n ^ 54j,and CnH2n-48^4yiOi (45 < n 

< 55); (2), with the basic porphyrin structure and one more sulfur 
atom, corresponding to molecules CnH2n-38^ 4^1^101 (36 ^ n ^ 

41),Cr,H2n-4oN4ViSiOi (35 ^ H ^ 51),Cr^H2n-42^4y l^lO i (36 < fl 

< 54),Cr,H2n-44N4ViSiOi (41 ^ H ^ 55XCnH2n-46^4y iSfii (39 < 

n < 55); (3), with five nitrogen atoms, one or two oxygen atoms 
corresponding to molecules CnH2n-27N5ViOi (29 < n < 

40) ,CnH2n-29^5yiOi (34 ^ H ^ 42),CnH2n-33^ sV lO i (31 ^ H ^ 
38),CnH2n-35^5yiOi (32 ^ fl ^ 41),CnH2n-27^ sV 1O2 (32 ^ fl ^ 

41) and CnH2n-29^5yi02 (33 ^fi^ 42). 



Discussion 

Based on the accurate molecular weight and the DBEs, the experi- 
ments of collision induced dissociation (CID) were conducted to 
determine the structures of these new compounds. Supplementary 
Figure 7 and Figure 8 were the extended mass spectrums of CID 
experiments. In Supplementary Figure 7, [C44H48N40iVi + H]^ 
and [C41H42N4O1S1 Vi + H] ^ could still be detected with the increas- 
ing of collision voltages (CV) from — 1.5 eV to —20 eV, —30 eV, we 
proposed the reasonable structures of these new compounds, which 
was shown in Figure 4 (a) and (b). They are the vanadyl porphyrins 
containing more fused aromatic rings and the functional groups of 
thiophene, which existed stably. These new vanadium compounds 
with sulfur atoms may be generated from organic sulfur in the source 
kerogens, or could be added by thermochemical sulfate reduction 
(TSR) /bacterial sulfate reduction (BSR) during the process of pet- 
roleum generation^^"^^, which would convert side chains into con- 
densed aromatic rings. In Supplementary Figure 8, the mass peak of 
[C35H43N5Vi02 + H]^ at m/z 617 was isolated with the window 
width of 1 Da under the model of CID, following by the spectra 
collecting at eight different collision voltages (CV). The peaks at 
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Figure 4 | The reasonable structure of new class species of vanadium compounds in Venezuela Orinoco crude oil detected in purified fractions. 

(a) : For the vanadyl porphyrins in fractions M5 and M6, with different DBFs from 23 to 27, containing more benzene rings linked the porphyrin ring; 

(b) : For the vanadium compounds with one sulfur atom in fractions M5 and M6, with different DBFs from 22 to 26, containing the function group of 
thiophene; (c): For the vanadium compounds with five nitrogen atoms and one oxygen atom in fractions M5 and M6, with different DBFs, 17, 18, 20, and 
21 respectively, containing the function group of amino; (d): For the vanadium compounds with five nitrogen atoms and two oxygen atoms in fractions 
M5 and M6, with different DBFs, 17 and 18, containing the function groups of amino and ether. 



m/z 530 and 516, corresponded to the vanadyl porphyrins 
[C3iH34N4ViOi + H]+ and [C3oH32N4ViOi + H]+, with loss of the 
functional group of C4H9N1O1 due to breakage of a side chain, which 
had a DEE of zero, that meant there was no double band in 
C4H9N1O1. Hence, we suggested that the structure of C4H9N1O1 
contained the amine function group and/or the ether link, connected 
with the ETIO and DPEP porphyrin rings, rather than an amide 
which has a DEE of one. Figure 4 (c) and (d) shows the possible 
structures of the new vanadium compounds with five nitrogen atoms 
and one or two oxygen atoms. Hodgson^^ gave preliminary evidence 
for protein fragments associated with porphyrins, based on which 
the structures are reasonable. These new compounds contain N, S, 
and O atoms which would enhance aggregation with asphaltene 
molecules in heavy oils. 

To verify these possible structures, the molecular level structural 
optimization had been investigated using the density functional the- 
ory (DFT) of quantum chemical method, calculating at the E3LYP 
and E3LYP/LanL2DZ/6-31 G++ level of theory by Gaussian soft- 



ware. The calculation results showed that these possible structures of 
new vanadium compounds could be existed stably, which were 
shown in Supplementary Figure 9 and Supplementary Table 1 and 
Table 2. 

In summary, we have found sixteen new series of vanadium com- 
pounds in Venezuela heavy crude oil and provided the evidences of 
ultra-high resolution mass spectrometry. The suggested structures 
are significant for the better understanding of the existing form of 
vanadium compounds in the heavy fossil fuels, and initiate the recog- 
nition of the broad range of porphyrins that can occur. 

Methods 

Sample Pretreatment. Venezuela Orinoco heavy crude oil sample was obtained from 
the Petro China Liaohe Petrochemical refinery. The crude oil sample was subjected to 
solvents and silica gel chromatographic separations described specifically elsewhere^*^. 
Briefly highlight the points of separation method here, firstly, the oil sample was 
dissolved in chloroform, followed by adding silica gel to form a slurry mixture; after 
evaporating at room temperature in a fume hood, the remaining oil/silica gel mixture 
was transferred into the thimble of Soxhlet extractor; then the Soxhlet extractions 
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were performed using methanol and toluene sequentially as solvents for 40 h and 
24 h, respectively, yielding the methanol solubles and toluene solubles. The methanol 
soluble fraction was separated into various subfractions by introducing the methanol 
soluble fraction on top of silica gels in a glass column and sequentially eluting with 
solvents of increasing polarity to yield various silica gel chromatography subfractions, 
named Ml to M7. The toluene soluble fraction was divided into nC7 insolubles and 
solubles. The nC7 solubles were fractionated into various subfractions by introducing 
the nC7 solubles on top of silica gels in a glass column and sequentially eluting with 
solvents of increasing polarity to yield various silica gel chromatography subfractions, 
named Tl to T7. 

The vanadium concentrations in the silica gel chromatographic subfractions of 
methanol soluble and toluene soluble were determined by graphite furnace atomic 
absorption spectrometer (GFAAS, Beijing Puxi General Analytical Instrument Co. 
Ltd. TAS990) . The results showed majority of vanadium compounds were enriched in 
M4, M5 and M6 of methanol soluble subfractions, 21 16.9 wppm, 31 13.6 wppm, and 
4380.2 wppm, respectively. The UV-vis spectra of silica gel chromatographic M4 to 
M6 subfractions showed the characteristic UV-vis absorption band for vanadyl 
porphyrins at the Soret band of 4 10 nm, (3-bandsof 533 nm, and a-band of 572 nm^^. 
Therefore, M4, M5, and M6 subfractions were investigated by ESI FT-ICR MS. 

ESI FT-ICR MS Analysis. Ten milligrams of crude oil sample and its fractions were 
diluted with 1 mL of toluene. Two to fifteen micro-liters of each diluted sample was 
further diluted with 1 mL of toluene/methanol (1:1, v/v) solution to yield 0.02 to 
0.15 mg/mL solutions. Five micro-liters of formic acid were added to the solutions 
prior to the positive-ion ESI FT-ICR MS analysis. A Bruker apex-ultra FT-ICR MS 
equipped with a 9.4 T actively shielded superconducting magnet was used. 

The analytes were infused through an Apollo II electrospray source at 180 |iL/h 
using a syringe pump. The operating conditions for positive ion formation were 
—4.0 kV emitter voltage, —4.5 kV capillary column front end voltage, and 320 V 
capillary column end voltage. Ions accumulated for 0.1 s in a hexapole with 2.4 V DC 
voltage and 500 Vp-p RF amplitude. The quadrupole (Ql) was optimized to obtain a 
broad range for ion transfers. An argon-filled hexapole collision cell was operated at 
5 MHz and 700 Vp-p RF amplitude, ions accumulated for 0.6 s and collision voltage 
was set to — 1.5 eV. The extraction period for ions from the hexapole to the ion 
cyclotron resonance cell was set to 1.5 ms. The rf excitation was attenuated at 
1 1.75 dB. A 4M datasets were acquired for a corresponding mass range of 200 Da to 
1000 Da. A total of 128 scans were co-added to enhance the signal-to-noise ratio and 
dynamic range. 

ESI FT-ICR MS Data Processing. The FT-ICR MS was internally calibrated using a 
Ni class homologous series which were [CnH2n-i7Ni + H]"^ and [CnH2n-i9Ni + H]"^. 
The internal quadratic calibration was also performed. Peaks with relative abundance 
greater than six times the standard deviation of the baseline noise level were exported 
to a spreadsheet. Data analysis was performed by selecting a two-mass scale- 
expanded segment in the middle of the mass spectrum, followed by the detailed 
identification of each peak. The peak of at least one of each heteroatom class species 
was arbitrarily selected as a reference. Species with the same heteroatom class and 
their homologs with different double bond equivalent (DBF) values and carbon 
numbers were searched within a set of 0.002 Kendrick mass defect tolerance. The 
details of data analysis procedure have been described elsewhere^^. 
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